SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Stocco, Douglas M. 

Clark, Barbara J. 

(11) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR REGULATION 0 

STEROIDOGENESIS 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Akin, Gump, Strauss, Hauer & Feld, L.L.P. 

(B) STREET: 1900 Frost Bank Plaza, 816 Congress Avenue 

(C) CITY: Austin 

(D) STATE: TX 

(E) COUNTRY: U.S.A. 

(F) ZIP: 78701 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Unknown 

(B) PILING DATE: Concurrently Herewith 

(C) CLASSIFICATION: Unknown. 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Mayfield, Denise L. 

(B) REGISTRATION NUMBER: 33,732 

(C) REFERENCE /DOCKET NUMBER: 04 3 375 . 0006 /DLM 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 512/499-6200 

(B) TELEFAX: 512/499-6290 



5 



{2} INFORMATION FOR SEQ ID NO : 1 : 



(ii SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 



GTCGACCCAC GCGTCCGCTC AGGACCTTGA AAGGCTCAGG AAGAACAACC CTTGAGCACC 
TCAGCACTCA GCATGTTCCT CGCTACGTTC AAGCTGTGTG CTGGAAGCTC CTATAGACAT 



ATGCGGAATA TGAAAGGATT AAGGCACCAA GCTGTGCTGG CCATTGGCCA AGAGCTCAAC 
TGGAGAGCAC TGGGGGATTC CAGTCCCGGG TGGATGGGTC AAGTTCGACG TCGGAGCTCT 
CTGCTTGGTT CTCAACTGGA AGCAACACTC TATAGTGACC AGGAGCTGTC CTACATCCAG 
CAGGGAGAGG TGGCTATQCA GAAGGCCTTG GGCATACTCA ACAACCAGGA AGGCTGGAAG 
AAGGAAAGCC AGCAGGAGAA CGGGGACGAA GTGCTAAGTA AGATGGTGCC AGATGTGGGC 



AAGGTGTTTC GCTTGGAGGT GGTGGTAGAC CAGCCCATGG ACAGACTCTA TGAAGAACTT 
GTGGACCGCA TGGAGGCCAT GGGAGAGTGG AACCCAAATG TCAAGGAGAT CAAGGTCCTG 
CAGAGGATTG GAAAAGACAC GGTCATCACT CATGAGCTGG CTGCGGCGGC AGCAGGCAAC 



15 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



20 



30 
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660 



10 



CTGGTGGGGC CTCGAGACTT CGTGAGCGTG CGCTGTACCA AGCGCAGAGG TTCCACCTGT 

GTGCTGGCAG GCATGGCCAC ACATTTTGGG GAGATGCCGG AGCAGAGTGG TGTCATCAGA 72 0 

5 GCTGAACACG GCCCCACCTG CATGGTGCTT CATCCACTGG CTGGAAGTCC CTCCAAGACT 7 80 

AAACTCACTT GGCTGCTCAG TATTGACCTG AAGGGGTGGC TGCCGAAGAC AATCATCAAC 84 0 

CAGGTCCTAT CGCAGACCCA GATAGAGTTC GCCAACCACC TGCGCAAGCG CCTGGAAGCC 900 

AGCCCTGCCT CTGAGGCCCA GTGTTAAGGA CTGTCCACCA CATTGACCTG CAAATCATTG 960 

GAAGCTCTCA CAGGAAGCCT GCAAGTCTGT CCATCTTCAG CTAACAGCAT CGGGAGGGGT 102 0 

5 GGTAGTCAGG AGACACTAGG ACTGACTGGT AAAATCAGGA TCAGCAAAAT AGAAATGAGG 108 0 

CTTAGAATAA AAGTTCTCTA GTGTCTCCCA CTGCATAGCT GTGAAGGCTA AGGGATAAGT 1140 

AGCTATGAAA CCTTTCATCT AGGCTTGTAT ATGCTGACCT AAAAGACACC AGCAGCTACG 1200 

AACAGGGGAT GCTAAGGATC GGGAACTGTT GTCTTACCAG CTCCAAATGT CACTACCTGA 1260 

AGGCAGTGTG CACACAAAGC AAGGTCTTGC CTAGGAAACT CTGTAAAAGT TCTCCTCTGT 132 0 
25 AAAAGGCCAG AACTTGAATQ AAACTACCTA CAAAGGGCCT TTCCAGAGTA TTCCAACTTT 

TCTCTGAGGA GAAATGAAAC CATCATTGTG CCGACTTCCC TACTAATCCC ATGACAATAA 1440 

AGAACATACA TAAAAAAAAA AAAAAA 1466 

30 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 284 amino acida 
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20 



1380 



# 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Phe Leu Ala Thr Phe Lys Leu Cys Ala Gly Ser Ser Tyr Arg His 
15 10 15 

Met Arg Asn Met Lys Gly Leu Arg His Gin Ala Val Leu Ala He Gly 
20 25 30 

Gin Glu Leu Asn Trp Arg Ala Leu Gly Asp Ser Ser Pro Gly Trp Met 
35 40 45 

Gly Gin Val Arg Arg Arg Ser Ser Leu Leu Gly Ser Gin Leu Glu Ala 
50 55 60 

Thr Leu Tyr Ser Asp Gin Glu Leu Ser Tyr He Gin Gin Gly Glu Val 
65 70 75 80 

Ala Met Gin Lys Ala Leu Gly He Leu Asn Asn Gin Glu Gly Trp Lys 
85 90 95 

Lys Glu Ser Gin Gin Glu Asn Gly Asp Glu Val Leu Ser Lys Met Val 

100 105 110 

Pro Asp Val Gly Lys Val Phe Arg Leu Glu Val Val Val Asp Gin Pro 
115 120 125 

Met Asp Arg Leu Tyr Glu Glu Leu Val Asp Arg Met Glu Ala Met Gly 
130 135 140 
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Glu Trp Asn Pro Asn Val Lys Glu He Lys Val Leu Gin Arg He Gly 
145 150 155 160 

Lys Asp Thr Val He Thr His Glu Leu Ala Ala Ala Ala Ala Gly Asn 

165 170 175 

Leu Val Gly Pro Arg Asp Phe Val Ser Val Arg Cys Thr Lys Arg A_rg 
180 185 190 

Gly Ser Thr Cys Val Leu Ala Gly Met Ala Thr His Phe Gly Glu Met 
195 200 205 

Pro Glu Gin Ser Gly Val He Arg Ala Glu His Gly Pro Thr Cys Met 
210 215 220 

Val Leu His Pro Leu Ala Gly Ser Pro Ser Lys Thr Lys Leu Thr Trp 
225 230 235 240 

Leu Leu Ser He Asp Leu Lys Gly Trp Leu Pro Lys Thr He He Asn 
245 250 255 

Gin Val Leu Ser Gin Thr Gin He Glu Phe Ala Asn His Leu Arg Lys 

260 265 270 

Arg Leu Glu Ala Ser Pro Ala Ser Glu Ala Gin Cys 

275 280 

INFORMATION FOR SEQ ID NO; 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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m 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

5 Ala Glu His Gly Pro Thr Cys Met Val Leu His Pro Leu Ala 

15 10 

(2) INFORMATION FOR SEQ ID NO : 4 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

20 Ala Leu Gly lie Leu Asn Asn Gin Glu Gly Trp Lys 

15 10 



(2) INFORMATION FOR SEQ ID NO : 5 : 



25 



(i) SEQUKNCB CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

35 
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Gly Ser Thr Cys Val Leu Ala Gly 
1 5 

Pro Glu Gin 



Met Ala Thr His Phe Gly Glu Met 
10 15 



(2) INFORMATION FOR SEQ ID NO: 6: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

20 Asn Gin Glu Gly Trp Lys 

1 5 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



-86- 



Ala Glu His Gly Pro Thr Cys Met Val 
1 5 



5 (2) INFORMATION FOR SEQ ID NO : 8 : 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

lie Leu Asn Asn Gin Glu Gly Trp Lys Lys Glu 
15 10 

20 (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLKCULB TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 

30 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(3, 12, 15, 18) 

(D) OTHER INFORMATION: /mod_base« OTHER 
35 /note- "N - (A or C or G or T/U) or (unknown or other) " 
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(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base» OTHER 
5 /note- "R = A or G" 

' iX } FEATURE : 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(9, 21) 

10 (D) OTHER INFORMATION: /mod^base- OTHER 

/note- "Y = C or T/U" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

15 

GCNGARCAYG GNCCNACNTG YATGG 



(2) INFORMATION FOR SEQ ID NO: 10: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



30 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(5, 17) 

(D) OTHER INFORMATION: /mod_base- OTHER 
/note- "R « A or G" 



35 (ix) FEATURE: 
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(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(8, 11, 14, 23) 

(D) OTHER INFORMATION: /mod_base« OTHER 
/note« "N = (A or C or G or T/U) or {unknown or other)" 

5 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_^base 

(B) LOCATION: 20 

(D) OTHER INFORMATION: /mod_base« OTHER 
10 /note- "Y a C or T/U" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCATRCANGT NGGNCCRTGY TCNGC 

15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLBCULB TYPB: other nucleic acid 

(A) DESCRIPTION: /desc - "DNA" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 
30 (B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base« OTHER 
/note- "Y » C or T/U" 

(ix) FEATURE: 
35 (A) NAME/KEY: modified base 
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10 



15 



(B) LOCATION: one-of{6, 9) 
(D) OTHER INFORMATION: /mod_base= OTHER 
/notea "R » A or G" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_ba3e 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /modjoase- OTHER 
/note= "N (A or C or G or T/U) or (unknown or other) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AAYCARCARG GNTGGAA ^-j 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
25 (A) DESCRIPTION: /desc . "DNA" 

(xi) SEQUENCB DESCRIPTION: SEQ ID NO: 12: 

TTCCANCCYT CYTGRTT 

30 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 401 base pairs 



-90- 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



60 



5 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

0 AACCAGGAAG GCTGGAAGAA GGAAAGCCAG CAGGAGAACG GGGACGAAGT GCTAAGTAAG 

ATGGTGCCAG ATGTGGGCAA GGTGTTTCGC TTGGAGGTGG TGGTAGACCA GCCCATGGAC 120 

AGACTCTATG AAGAACTTGT GGACCGCATG GAGGCCATGG GAGAGTGGAA CCCAAATGTC 18 0 

AAGGAGATCA AGGTCCTGCA GAGGATTGGA AAAGACACGG TCATCACTCA TGAGCTGGCT 240 

GCGGCGGCAG CAGGCAACCT GGTGGGGCCT CGAGACTTCG TGAGCGTGCG CTGTACCAAG 3 00 

20 CGCAGAGGTT CCACCTGTGT GCTGGCAGGC ATGGCCACAC ATTTTGGGGA GATGCCGGAG 360 

CAGAGTGGTG TCATCAGAGC TGAACACGGC CCCACCTGCA T 401 



25 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « "RNA" 

35 
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# # 



10 



20 



120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GUCGACCCAC GCGUCCGCUC AGGACCUUGA AAGGCUCAGG AAGAACAACC CUUGAGCACC 6 0 
5 UCAGCACUCA GCAUGUUCCU CGCUACGUUC AAGCUGUGUG CUGGAAGCUC CUAUAGACAU 

AUGCGGAAUA UGAAAGGAUU AAGGCACCAA GCUGUGOJGG CCAUUGGCCA AGAGCUCAAC ISO 

UGGAGAGCAC UGGGGGAUUC CAGUCCCGGG UGGAUGGGUC AAGUUCGACG UCGGAGCUCU 24 0 

CUGCUUGGUU CUCAACUGGA AGCAACACUC UAUAGUGACC AGGAGCUGUC CUACAUCCAG 3 00 

CAGGGAGAGG UGGCUAUGCA GAAGGCCUUG GGCAUACUCA ACAACCAGGA AGGCUGGAAG 360 

15 AAGGAAAGCC AGCAGGAGAA CGGGGACGAA GUGCUAAGUA AGAUGGUGCC AGAUGUGGGC 42 0 

AAGGUGXraCJC GCUUGGAGGU GGUGGUAGAC CAGCCCAUGG ACAGAOJCUA UGAAGAACUU 480 

GUGGACCGCA UGGAGGCCAU GGGAGAGUGG AACCCAAAUG UCAAGGAGAU CAAGGUCCUG 540 

CAGAGGAUUG GAAAAGACAC GGUCAUCACU CAUGAGCUGG CUGCGGCGGC AGCAGGCAAC 600 

CUGGUGGGGC CUCGAGACUU CGUGAGCGUQ CGCUGUACCA AGCGCAGAGG UUCCACCUGU 660 

25 GUGCUGGCAG GCAUGGCCAC ACAUUUUGGG GAGAUGCCGG AGCAGAGUGG UGUCAUCAGA 720 

GCUGAACACa GCCCCACCUQ CAUGGUGCUU CAUCCACUGG CUGGAAGUCC CUCCAAGACU 780 

AAACUCACUU GGCUGCUCAG UAUUGACCUG AAGGGGUGGC UGCCGAAGAC AAUCAUCAAC 840 

30 

CAGGUCCUAU CGCAGACCCA GAUAGAGXKJC GCCAACCACC UGCGCAAGCG CCUGGAAGCC 900 

AGCCCUGCOJ CUGAGGCCCA GUGUUAAGGA CUGUCCACCA CAUUGACCUG CAAAUCAUUG 96 0 
35 GAAGCUCUCA CAGGAAGCCU GCAAGUCUGU CCAUCUUCAG CUAACAGCAU CGGGAGGGGU 



1020 
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• # 



10 



GGUAGUCAGG AGACACUAGG ACUGACUGGU AAAAUCAGGA UCAGCAAAAU AGAAAUGAGG 10 80 

CUUAGAAUAA AAGUUCUCUA GUGUCUCCCA CUGCAUAGCU GUGAAGGCUA AGGGAUAAGU 114 0 

5 AGCUAUGAAA CCUUUCAUCU AGGCUUGUAU AUGCUGACCU AAAAGACACC AGCAGCUACG 12 0 0 

AACAGGGGAU GCUAAGGAUC GGGAACUGUU GUCUUACCAG CUCCAAAUGU CACUACCUGA 126 0 

AGGCAGUGUG CACACAAAGC AAGGUCUUGC CUAGGAAACU CUGUAAAAGU UCUCCUCUGU 132 0 

AAAAGGCCAG AACUUGAAUG AAACUACCUA CAAAGGGCCU UUCCAGAGUA UUCCAACUUU 1380 

UCUCUGAGGA GAAAUGAAAC CAUCAUUGUG CCGACUUCCC UACUAAUCCC AUGACAAUAA 144 0 

15 AGAACAUACA UAAAAAAAAA AAAAAA 14gg 



(2) INFORMATION FOR SEQ ID NO: 15: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



25 



30 



(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



267 GTGGATTAAC CAGGTTCGGC GGCGGAGCTC TCTACTCGGT TCTCGGCTGG AAGAGACTCT 326 
327 CTACAGTGAC CAGGAGCTGG CCTATCTCCA GCAGGGGGAG GAGGCCATGC AGAAGGCCTT 3 86 
38 7 GGGCATCCTT AGCAACCAAG AGGGCTGGAA GAAGGAGAGT CAGCAGGACA ATGGGGACAA 446 
447 AGTGATGAGT AAAGTGGTCC CAGATGTGGG CAAGGTGTTC CGGCTGGAGG TCGTGGTGGA 506 
35 507 CCAGCCCATG GAGAGGCTCT ATGAAGAGCT CGTGGAGCGC ATGGAAGCAA TGGGGGAGTG 566 
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567 


GAACCCCAAT 


GTCAAGGAGA 


TCAAGGTCCT 


GCAGAAGATC 


GGAAAAGATA 


CATTCATTAC 


626 


627 


TCACGAGCTG 


GCTGCCGAGG 


CAGCAGGAAA 


CCTGGTGGGG 


CCCCGTGACT 


TTGTGAGCGT 


686 


687 


GCGCTGTGCC 


AAGCGCCGAG 


GCTCCACCTG 


TGTGCTGGCT 


GGCATGGACA 


CAGACTTCGG 


746 


747 


GAACATGCCT 


GAGCAGAAGG 


GTGTCATCAG 


GGCGGAGCAC 


GGTCCCACTT 


GCATGGTGCT 


806 


807 


TCACCCGTTG 


GCTGGAAGTC 


CCTCTAAGAC 


CAAACTTACG 


TGGCTACTCA 


GCATCGACCT 


866 


867 


CAAGGGGTGG 


CTGCCCAAGA 


GCATCATCAA 


CCAGGTCCTG 


TCCCAGACCC 


AGGTGGATTT 


926 


927 


TGCCAACCAC 


CTGCGCAAGC 


GCCTGGAGTC 


CCACCCTGCC 


TCTGAAGCCA 


GGTGTTGAAG 


986 


987 


AC 988 















(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

127 ATGCTGCTAQ CGACATTCAA GCTGTGCGCT GGGAGCTCCT ACAGACACAT GCGCAACATG 186 
187 AAGGGGCTGA GGCAACAGGC TGTGATGGCC ATCAGCCAGO AGCTGAACCG GAGGGCCCTG 246 
247 GGGGGCCCCA CCCC 260 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 
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# # 



5 



10 



(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

1051 ACTGGAAGCC TGCAAGTCT 106 9 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 
15 (D) TOPOLOGY: 



(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 



20 SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



1 MLLATFKLCA GSSYRHMRNM KGLRQQAVMA ISQELNRRAL GGPTPSTWIN 50 

51 QVRRRSSLLG SRLEETLYSD QELAYLQQGB EAMQKALGIL SNQEGWKKES 100 

101 QQDNGDKVMS KWPDVGKVF RLEVWDQPM ERLYEELVER MEAMGEWNPN 150 

151 VKEIKVLQKI GKDTPITHEL AAEAAGNLVG PRDFVSVRCA KRRGSTCVLA 200 

201 GMATDFGNMP EQKGVIRABH GPTCMVLHPL AGSPSKTKLT WLLSIDLKGW 250 

251 LPKSIINQVL SQTQVDFANHL RKRLESHPA SEARC 285 



30 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 
35 (D) TOPOLOGY: 
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(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID 

agaacaccag gtccaggctg cagctgcggg 
aggacgaaga accacccttg agagaagagg 
gaccccacca ctgccacatt tgccaggaaa 
CTGGGAGCTC CTACAGACAC ATGCGCAACA 
CCATCAGCCA GGAGCTGAAC CGGAGGGCCC 
ACCAGGTTCG GCGGCGGAGC TCTCTACTCG 
ACCAGGAGCT GGCCTATCTC CAGCAGGGGG 
TTAGCAACCA AGAGGGCTGG AAGAAGGAGA 
GTAAAGTGGT CCCAGATGTG GGCAAGGTGT 
TGGAGAGGCT CTATGAAGAG CTCGTGGAGC 
ATGTCAAGGA GATCAAGGTC CTGCAGAAGA 
TGGCTGCCGA GGCAGCAGGA AACCTGGTGG 
CCAAGCGCCG AGGCTCCACC TGTGTGCTGG 
CTGAGCAGAA GGGTGTCATC AGGGCGGAGC 
TGGCTGGAAG TCCCTCTAAG ACCAAACTTA 
GGCTGCCCAA GAGCATCATC AACCAGGTCC 
ACCTGCGCAA GCGCCTGGAQ TCCCACCCTG 
gctgttccca actgtgccca gctgcactgg 
gaagcctgca agtctaagat ctccatctgg 
gagtatgaca ctaggattca gattggtgaa 
ctcttggatt aaaaggtaac ttcattcact 
ctaaaaataa ttgtaaaact ttttttctgg 
aaaatgctag tggctgatat gggtgtgggg 
ctttatgggc tcaagaatgc catgcgctgg 
gggtctcctg cagccctctg ctcctcccgg 
gcaccccaca gtgggtgcct tccagaaata 
aaactcatta gtagacatgt ttccctattg 
cataattcac acaaaaaaaa a 



NO: 19: 

actcagaggc gaacgttgag gggctcagga 6 0 
cagcagcagc gcggcagcag cagcggcagc 12 0 
caATGCTGCT AGCGACATTC AAGCTGTGCG 18 0 
TGAAGGGGCT GAGGCAACAG GCTGTGATGG 240 
TGGGGGGCCC CACCCCTAGC ACGTGGATTA 3 00 
GTTCTCGGCT GGAAGAGACT CTCTACAGTG 360 
AGGAGGCCAT GCAGAAGGCC TTGGGCATCC 42 0 
GTCAGCAGGA CAATGGGGAC AAAGTGATGA 480 
TCCGGCTGGA GGTCGTGGTG GACCAGCCCA 54 0 
GCATGGAAGC AATGGGGGAG TGGAACCCCA 600 
TCGGAAAAGA TACATTCATT ACTCACGAGC 66 0 
GGCCCCGTGA CTTTGTGAGC GTGCGCTGTG 720 
CTGGCATGGC CACAGACTTC GGGAACATGC 780 
ACGGTCCCAC TTGCATGGTG CTTCACCCGT 840 
CGTGGCTACT CAGCATCGAC CTCAAGGGGT 900 
TGTCCCAGAC CCAGGTGGAT TTTGCCAACC 960 
CCTCTGAAGC CAGGTGTTGA agaccagcct 1020 
tacacacgct catcaggaga atccctactg 1080 
tgacagtggg atgggtgggg ttcgtgttta 1140 
agtttttagt accaagaaaa cagggatgag 1200 
gattagctat gacatgaggg ttcaggcccg 1260 
gcccttatgt acccacctaa aaccatcttt 1320 
gatgctaacc acagggcctg agaagtcttg 1380 
cagtacatgt gcacaaagca gaatctcaga 1440 
ccgctgcaca gcaacaccac agaacaagca 1500 
tagtccaagc tttctctgtg gaaaaagaca 1560 
ctttcatagg caccagtcag aataaagaat 1620 

1641 
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